Fine-Tuning GPT-3 for Russian Text Summarization

نویسندگان

چکیده

Automatic summarization techniques aim to shorten and generalize information given in the text while preserving its core message most relevant ideas. This task can be approached treated with a variety of methods, however, not many attempts have been made produce solutions specifically for Russian language despite existing localizations state-of-the-art models. In this paper, we showcase ruGPT3 ability summarize texts, fine-tuning it on corpora news their corresponding human-generated summaries. Additionally, employ hyperparameter tuning so that model’s output becomes less random more tied original text. We evaluate resulting texts set metrics, showing our solution surpass performance without additional changes architecture or loss function. Despite being able sensible summaries, model still suffers from number flaws, namely, is prone altering Named Entities present (such as surnames, places, dates, etc.), deviating facts stated document, repeating summary. #COMESYSO1120.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Biogeography-Based Optimization Algorithm for Automatic Extractive Text Summarization

    Given the increasing number of documents, sites, online sources, and the users’ desire to quickly access information, automatic textual summarization has caught the attention of many researchers in this field. Researchers have presented different methods for text summarization as well as a useful summary of those texts including relevant document sentences. This study select...

متن کامل

Text Summarization Using Cuckoo Search Optimization Algorithm

Today, with rapid growth of the World Wide Web and creation of Internet sites and online text resources, text summarization issue is highly attended by various researchers. Extractive-based text summarization is an important summarization method which is included of selecting the top representative sentences from the input document. When, we are facing into large data volume documents, the extr...

متن کامل

EXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS

Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...

متن کامل

A survey on Automatic Text Summarization

Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...

متن کامل

Text Summarization Challenge 2 - Text Summarization Evaluation At NTCIR Workshop 3

We describe the outline of Text Summarization Challenge 2 (TSC2 hereafter), a sequel text summarization evaluation conducted as one of the tasks at the NTCIR Workshop 3. First, we describe briefly the previous evaluation, Text Summarization Challenge (TSC1) as introduction to TSC2. Then we explain TSC2 including the participants, the two tasks in TSC2, data used, evaluation methods for each tas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture notes in networks and systems

سال: 2021

ISSN: ['2367-3370', '2367-3389']

DOI: https://doi.org/10.1007/978-3-030-90321-3_61